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ABSTRACT 

Gravitational clustering is an intrinsically non-linear process that generates significant non- 
Gaussian signatures in the density field. We consider how these affect power spectrum deter- 
minations from galaxy and weak-lensing surveys. Non-Gaussian effects not only increase the 
individual error bars compared to the Gaussian case but, most importantly, lead to non-trivial 
cross-correlations between different band-powers, correlating small-scale band-powers both among 
themselves and with those at large scales. We calculate the power-spectrum covariance matrix 
in non- linear perturbation theory (weakly non-linear regime) , in the hierarchical model (strongly 
non-linear regime), and from numerical simulations in real and redshift space. In particular, we 
show that the hierarchical ansatz cannot be strictly valid for the configurations of the trispectrum 
involved in the calculation of the power-spectrum covariance matrix. 

We discuss the impact of these results on parameter estimation from power-spectrum mea- 
surements and their dependence on the size of the survey and the choice of band-powers. We 
show that the non-Gaussian terms in the covariance matrix become dominant for scales smaller 
than the non-linear scale k n i ~ 0.2 h/Mpc, depending somewhat on power normalization. Fur- 
thermore, we find that cross-correlations mostly deteriorate the determination of the amplitude 
of a rescaled power spectrum, whereas its shape is less affected. In weak lensing surveys the pro- 
jection tends to reduce the importance of non-Gaussian effects. Even so, for background galaxies 
at redshift z ~ 1, the non-Gaussian contribution rises significantly around I ~ 1000, and could 
become comparable to the Gaussian terms depending upon the power spectrum normalization 
and cosmology. The projection has another interesting effect: the ratio between non-Gaussian 
and Gaussian contributions saturates and can even decrease at small enough angular scales if the 
power spectrum of the 3D field falls faster than fc~ 2 . 



Subject headings: large-scale structure of universe; methods: numerical; methods: statistical 
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Introduction 



Discussions in the literature on the measurement of the power spectrum P(k) (e.g. Feldman et al. 1994) 



and the estimation of cosmological parameters from it (e.g. Tegmark 1997b, Eisenstein et al. 1998, Hu & 



Tegmark 1998 ), have largely focused on the case of Gaussian random data. While this is a reasonable 
assumption in the case of the cosmic microwave background for Gaussian initial conditions (e.g. Tegmark 



1997 , Bond et al. 1998 , geljak 1998 ) , it is probably not a good approximation for galaxy and weak-lensing 



surveys except on sufficiently large scales (e.g. Feldman et al. 1994, Vogeley & Szalay 1996, Hamilton 



1997| , |Colombi et al. 1998| , |5eljak 1998b| , [Tegmark et al. 1998| , |Dodelson et al. 1997| ). This is because of 



the non-Gaussianity inevitably induced by gravitational clustering, which leads to increased error bars in 
individual band-power estimates and introduces correlations between them. In this paper, we quantify the 
size of these effects and the scales at which they become important. 

Understanding the statistical properties of power-spectrum estimators beyond the Gaussian approxima- 
tion requires at least a calculation of the power-spectrum covariance matrix, which involves the four-point 
function of the density field in Fourier space, the trispectrum. In order to do this, we use both analytic 
and numerical techniques. At weakly non-linear scales, non-linear perturbation theory (PT) can be used to 
understand quantitatively how the non-Gaussian effects are generated through mode-coupling. We provide 
a calculation of the relevant configurations of the trispectrum, and use them to obtain the covariance matrix 
of band-powers. 

Non-Gaussian effects are most significant at non-linear scales, where PT breaks down. To investigate 
this regime, we resort to numerical simulations, which we use to evaluate the power-spectrum covariance 
matrix by measuring the power spectrum in several realizations. At small scales, where virialization is 
reached, stable clustering suggests a simple behavior of higher-order correlation functions, known as the 
hierarchical ansatz. We use these arguments to understand the power-spectrum covariance matrix in the 
non-linear regime, which in turn allows us to extend in a simple way our results to the projected density 
field. 

The outline of the paper is as follows. In §|2|, we introduce the notations and basic expressions for the 
covariance matrix of the 3D (three-dimensional) power-spectrum estimates. Here, as in the rest of the paper, 
we focus on the sample-variance dominated regime. In other words, shot-noise is ignored - the emphasis 
here is on the effect of non-Gaussianity induced by gravitational clustering. Shot-noise will be included in 
a separate treatment. We present estimates of the power-spectrum covariance matrix using PT, numerical 



simulations and the hierarchical model in §3.1, 3.2 and 3.3, respectively. In particular, we show in the latter 



that the hierarchical ansatz does not provide a good description of the covariance matrix in the non-linear 
regime, because there are always contributions coming from large scales (§ |3.3.1 ). Instead, we find that 
the covariance matrix behaves as a "generalized" hierarchical model, with configuration dependence that 
resembles that in PT (Figs. 9-11). On the other hand, for trispectrum configurations fully in the non- 
linear regime, we verify the validity of the hierarchical ansatz and provide some new results concerning the 
importance of different contributions (Fig. 8). 

The impact of non-Gaussianity on parameter determination is illustrated in §^ with a simple example, in 
which the amplitude of the power spectrum is the only parameter of interest. We discuss the impact on the 
determination of the power spectrum shape as well. In all of the discussions above, it is implicitly assumed 
one can measure the mass power spectrum directly. This is of course complicated by biasing in the case of 
galaxy surveys. In fact, as we shall see, on scales where non-Gaussianity is non-negligible, galaxy-biasing 



is also likely to be non-trivial. Weak gravitational lensing (e.g. Blandford et al. 1991, Miralda-Escude 
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1991, |Kaiser 1992| , |Bernardcau ct al. 19971 [Jain fc Seljak 1997| , |Kaiser 1998] |Seljak 1998b] , |van Waerbeke 
ct al. 199E) promises to be a bias-free way to measure the mass power spectrum, but as we will show in 
weak-lensing surveys in the near future will likely cover small angular scales and thus receive significant 
contributions from non-linear fluctuations to the expected signals. In §[|, we derive the appropriate expression 
for the covariance matrix of the projected mass power spectrum, and show that the projection reduces the 
non-Gaussian effects of the matter distribution, but non-negligible residuals remain. In fact, for low matter 
density models where the cluster normalization implies larger rms density fluctuations, the non-Gaussian 
contribution to the covariance can dominate over the Gaussian value ( Van Waerbeke et al. 1999 ). Finally, 
we conclude in with a discussion. 



2. Definitions 

We shall be interested in the Fourier modes of the overdensity S(x) given by, 



Their statistical properties are described by the various connected moments, 

(<5(fci)) c = 

(*(fci)*(fca)>c = fe(fci 2 ) P(fci) 

(6(ki)6(k 2 )6(k 3 )) c = fo(fe m ) B(k u k 2 ,k 3 ) 

(S(k 1 )S(k 2 )S(k 3 )S(k A )) c = 5 D (k 123i ) T(k 1 ,k 2 ,k 3 ,k i ) 

(W(fe)-%))c - &(*i2... n )T B (fc 1 ,fc a ,-..,fc Tl ), (2) 

where ki...j = fcj + • • • + kj, with 8t>(x) denoting the Dirac delta distribution. In equation (|^), P(fc), 
B(k\,k 2 ,k 3 ) and T(ki, k 2 , k 3 , k±) denote the power spectrum, bispectrum and trispectrum, respectively. 
For the purpose of this paper we will need up to the fourth connected moment. 

Let's suppose we are given a survey with volume V, from which 5(k) can be constructed. We divide the 
Fourier space into shells (or bands) of width 8k centered on fc, = i x 6k, i = 1,2, and then average the 
variance of the k modes within each shell to obtain the following estimate of the power spectrum, 



P(h) = Vf^ AJjL 5(k)8(-k) (3) 

where the integration extends over modes within the shell centered at V s (ki) = A^k\8k is the volume of 
the shell, and Vf is the volume of the fundamental cell in k space, Vf = (2ir) 3 /V. For the rest of this paper, 
we will also use Pj interchangeably with P(fcj) wherever confusion will not arise. It is straightforward to 
calculate the covariance matrix of the power spectrum estimators. Combining equations (|^) and (|^), we get 

Cy = (P(A*)P(Ai)) - (P(*i))<P(*i)) - V f [^-Jij + T{ki,kj)} (4) 
where 5ij is the Kronecker delta and T is the bin-averaged trispectrum 
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The first term in equation (^) is the Gaussian contribution. In the Gaussian limit, each Fourier mode is an 
independent Gaussian random variable. The power estimates of different bands are therefore uncorrelated, 
and the covariance is simply given by where is the number of independent Gaussian variables, 

i.e. — V s (k i )/Vf. The second term in equation (|4|) arises because of non-Gaussianity, which generally 
introduces correlations between different Fourier modes, and hence it is not diagonal in general. 

Both terms in the covariance matrix in equation (Q) are proportional to Vf, or inversely proportional 
to V, the volume of the survey. But the Gaussian and non-Gaussian contributions scale in a different way 
with V s : while the Gaussian contribution decreases with the size of the shell, the non-Gaussian term remains 
constant. Therefore, when the covariance matrix is dominated by the non-Gaussian contribution the only 
way to reduce the variance of the power spectrum is to increase the volume of the survey instead of averaging 
over more Fourier modes. 

It is worth emphasizing here that we have ignored the effect of the survey window in the above expres- 
sions. We have implicitly assumed the modes of interest have wavelengths much smaller than the size of the 
survey, which is likely to be a good approximation as we are interested in the non-Gaussianity induced by 
gravity on small scales (how small is small is a question we will address). Note also that while S(k) is not 
restricted to be the mass overdensity in the above expressions, we will assume so in the rest of the paper. 
Galaxy surveys of course only probe directly the galaxy rather than the mass overdensity. For the most part, 
we ignore galaxy biasing in this paper. We will come back to this issue in the final section. 



3. Covariance Matrix of the Band-Power Estimates in 3D 

In structure formation scenarios where the initial conditions are Gaussian, the power-spectrum covari- 
ance matrix is expected to be diagonal on sufficiently large scales, with amplitude given by the first term in 
equation (Q). Gravitational clustering, however, inevitably generates a non- vanishing trispectrum through 
non-linear mode-coupling. It is thus expected that at small enough scales, the non-Gaussian contribution to 
the power-spectrum covariance matrix (second term in equation will dominate over the Gaussian term. 

To demonstrate this, we use non-linear perturbation theory (PT) to calculate at what scales significant 
non-Gaussian contributions first appear. We then use numerical simulations to measure their effects on 
smaller, non-linear scales, where PT breaks down. In the highly non-linear regime we study the validity of 
the hierarchical model for the trispectrum, and calibrate it using N-body simulations and Hyperextended 
PT dScoccimarro fc Fricman 1998 ). 



3.1. The Power-Spectrum Covariance Matrix in PT 



In tree-level (leading-order) PT, the trispectrum is given by (see e.g. Fry 1984 ): 



T(k 1 ,k 2 ,k 3 ,k A ) = i[F 2 (k 12 , -fci) F 2 (k 12 ,k 3 ) P1P12P3 + eye] +6[F 3 (fc 1 , fe 2 ,fe 3 ) P1P2P3 + eye], (6) 
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where Pj = P(fci), with fci2 = fci + &2 as defined before, "eye." denotes cyclic permutation of the arguments 
(12 terms in total in the first contribution, and 4 terms in the second contribution), and the kernels F n 
are obtained from solving the equations of motion of gravitational instability to n th order in PT ( |Goroff et 
al. 1986 ). Equation (||) tells us that only a particular class of trispectrum configurations (parallelogram) 
contributes to the non-Gaussian covariance of the band-power estimates. Substituting the above expression 
into equation (||), we have 



f d 3 kx 


f d 3 k 2 r 


L t v s (ki) 


J kj V 8 (kj) i 



12F 3 (fei, -fci, k 2 )P?P2 + 8F 2 2 (fei - fe, kfiPQki - fc 2 |)P| 



16F 2 (fei - fca, k 2 )F 2 (k 2 - fex, fe 1 )P 1 P 2 P(|fe 1 - fca|) + (fci «-> fe 2 ) 



(7) 



Using the facts that [^2(^1 — k 2 , fc 2 )]fc 1= fc 2 = (3 + 10x)/28, where x = fci • fc 2 , and that the angular 
average ( F 3 (ki, — fci, fc 2 ) )k 1= k 2 = ~ H/378, plus the approximate angular-averaged result 

a simple approximate expression for the diagonal components of the band-averaged trispectrum follows 



T{k u h)^ — Pf, (9) 

where we have further assumed that the power spectrum is approximately constant within the shell. Note 
that the non-Gaussian terms scale as the power spectrum cubed, therefore, to be consistent within PT, 
one-loop corrections to the power spectrum in the diagonal Gaussian term must be included. 

Figure [l] shows the diagonal elements of the covariance matrix, divided by the Gaussian contribution, 
calculated using perturbation theory for a survey with V — l(Gpc/h) 3 . The cosmological model we assume 
throughout the paper is standard cold dark matter (SCDM) with eg = 0.60. The increase in the diagonal 
variance relative to the Gaussian variance is very small, less than 1% for k < 0.2, which in part reflects the 
small bin-size considered, 8k = 2ir h/Gpc. The non-linear scale, where 4irk 3 P(k) = 1, is given by k n i = 0.33. 
It is important to emphasize that figure [j] depends on the binning adopted. In other words, the relative 
importance of the Gaussian and non-Gaussian variances for each band is dependent upon the size of the 
band. 

In figure ^ we show some of the off-diagonal elements, in terms of the cross-correlation coefficient 
r.jj = Cij / y/CaCjj. Each curve in figure || corresponds to r%j as a function of k% for a fixed kj. The 
corresponding kj can be inferred from where Tjj = 1. By definition each is independent of the volume 
of the survey but it does depend on the fc-space binning. In this case, the binning has been chosen to be 
constant with the smallest possible value, Sk = 2w h/Gpc, which maximizes the Gaussian contribution. The 
correlation coefficients r^, for i ^ j, are therefore quite small. A logarithmic binning would change the 
appearance of the figures. It is important to emphasize that although the correlation coefficients are small, 
there are also many of them. In fact because the size of the correlation coefficients depends on the choice 
of band-powers, they do not have a direct physical meaning. In the limit we are considering in this section, 
where the diagonal terms of the covariance are dominated by the Gaussian contributions we have in fact 
Tij oc 8k. In §^ we will address the importance of the non-Gaussian terms in a way independent of the 
binning procedure. 
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In order to understand the origin of power spectrum cross-correlations, let us consider the integrand in 
equation (Jfy, T(k\, —kx, k2, — k 2 ) = T^ki, k 2 ), and normalize it according to 



/?12(fcl,fe2) = T 4 (fei,fe2) ^ 

where A(fc) = 4-7rfc 3 P(fc). We then obtain 



1 1 V s (h)V s (k 2 ) lT 4 (fei,fe 2 ) Sh6k 2 



2P X 2 2P| A(fc 1 )A(fc 2 ) 2 (PiP 2 ) 3/2 V fcifc 



(10) 



2PP 



^/V s {k t )V s (k 3 



+/WA(fc i )A(fc J )] ) 



(11) 



with /3jj the bin-averaged version of f3i 2 . Figure || shows the coefficient (3i 2 as a function of configura- 
tion angle 9 between fci and fe 2 for different scales, assuming constant bin size as a function of scale 
Ski = 6k 2 = 27r/100 h/Mpc. This figure illustrates how modes get correlated, the maximum amplitude 
of correlation results for co- linear configurations (8 = 0), and the least amount of correlation corresponds 
to perpendicular modes (9 — 7r/2). This is exactly what is expected for structures formed by gravitational 
instability ( Scoccimarro 1997 , [gcoccimarro ct al. 1995 ). It would be interesting to take advantage of this 
pattern of correlations to build a power spectrum estimator that minimizes the amount of cross-correlations. 



3.2. The Power-Spectrum Covariance Matrix in Numerical Simulations 

We measured the power spectrum from 20 different PM simulations of the SCDM model with as — 0.60 
and a box-size of 100 Mpc/h with 128 3 particles and a 256 3 force grid. Figure ||a shows the diagonal elements 
of the covariance matrix normalized by the Gaussian variance, obtained by computing the power spectra in 
the 20 PM simulations and performing the ensemble average. The dashed line at weakly non-linear scales 
shows the predictions of PT, equation ([)]). As predicted by PT, the increase in the diagonal covariance due 
to non-Gaussian effects is quite small, even at the non-linear scale A(k n i) = 1 (shown in the plot as a vertical 
line). We see that the diagonal terms of the covariance matrix increase rapidly relative to the the Gaussian 
contributions, once the non-linear regime is reached. 

In the bottom panel we show the same diagonal components but normalized by P, which yields the 
fractional errors in the power spectrum estimates: op/P. The fractional error decreases with increasing foj 
as k~ 2 for linear scales but does so much more slowly once we enter the non-linear regime. It should be 
stressed that the quantities shown depend both on the assumed binning in fc-space and on the volume of 
the simulation box. Unless otherwise stated, all our measurements in numerical simulations are done using 
a constant bin size, 6k = 27r/100 h/Mpc, the fundamental mode of the simulation box. On the other hand, 
at sufficiently non-linear scales the covariance matrix becomes independent of the binning (but it is still 
inversely proportional to the simulation volume). 

In figure [| we show the correlation coefficients for different scales. Note that when both ki and kj 
fall well within the non-linear regime, the correlation coefficient is close to one, which implies that the power 
estimates at the two fc-shells are highly correlated. The correlation coefficient decreases as the shells become 
further apart, but it does so very slowly. This implies that the information-gain as more modes beyond the 
linear regime are considered increases very slowly compared to the Gaussian expectations. 

Figure [| shows a comparison between the correlation coefficients obtained using PT and N-Body sim- 
ulations. The comparison is made at og — 0.375 (z — 0.6), as opposed to eg — 0-60 (z — 0) in figure |[ so 
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that more wave-modes are in the weakly non-linear regime. As one can see, there are large dispersions in 
the measurements of due to the limited number of N-body simulations, but the measured values agree 
with the prediction of PT within the errors. 

As mentioned before, the relative importance of the Gaussian versus non-Gaussian contributions to the 
covariance matrix depends upon the choice of binning in fc-space. We have tested its effect with N-body 
simulations and the behavior is as expected from equation Note that even the distinction between 
diagonal and off-diagonal covariances is somewhat arbitrary: by choosing a coarser binning of Fourier space, 
what originally appears as correlations between two nearby shells now becomes part of the diagonal variance. 
As is clear from equation (Q) , the dependence with binning becomes less important as we enter the non-linear 
regime. 

Finally, figure fj] shows correlation coefficients in redshift space. These are calculated in the plane-parallel 
approximation, for the same realizations shown before, in the z = output, erg — 0.60. The top panel shows 
Tij for the monopole of the power spectrum, whereas the bottom panel shows r%j for the quadrupole moment 
of the power spectrum. Comparison with figure || shows that the effect of correlations in redshift space is 
somewhat suppressed with respect to the real-space clustering. That is indeed expected, since the velocity 
dispersion at small scales washes out clustering and therefore non-Gaussian effects. 



3.3. The Power-Spectrum Covariance Matrix in the Non-Linear Regime 

We now study the behavior of the power-spectrum covariance matrix in the non-linear regime, where 
non-Gaussian effects are most important, and compare the numerical simulations results to those predicted 
by the hierarchical model, which reproduces the observed scaling properties of higher-order correlations in 
the non-linear regime. 



3.3.1. Contributions to the Covariance Matrix 

It is instructive to write down the trispectrum in terms of the four-point correlation function, in order to 
understand the nature of the contributions to the power-spectrum covariance matrix. Writing the connected 
four point function 771234 =< 61626364 >c, we have 

T(ki, fc 2 , fc 3 , fc 4 ) = J ^p"^p"^p" ex P H( fc i ' x u + k 2 ■ x 2 4 + k 3 • CC34)] 771234, (12) 

where x,j = Xi — Xj, and due to statistical homogeneity 771234 is only a function of three vectors, say X14, 
CE24 and 0:34, therefore k^ = — fci23. The trispectrum configurations relevant for the covariance matrix, 
equation (^|), then read 

T(fci, -fei, k 2 , -fc 2 ) = J Jq^J^yJ^y ex P H( fc i ' ^12 + fe 2 ■ Z34)] 771234- (13) 

If we restrict to the non-linear regime, where k\ and k 2 are large, we see from Eq. ( p^ ) that this restricts 
X12 and X34 to be small; however, there is no restriction on x 2 a- Therefore, the power-spectrum covariance 
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matrix in the non-linear regime receives contributions from all scales, not only those in the non-linear regime. 
As we shall see next, this feature takes a particular form in the context of the hierarchical model. 



3.3.2. The Hierarchical Model (HM) 
The hierarchical form for the trispectrum reads, 

T(k u k 2 , fc 3 , fe 4 ) = R a [P1P2P13 + eye] + R b [PiF 2 P 3 + eye], (14) 

which has been proposed to explain the scaling properties of galaxy clustering in the highly non-linear 



regime (Fry fc Peebles 1978|). In terms of the four-point function, rj(r), and the two-point function, £(r) 



the hierarchical model assumes rj{r) cx £ 3 (r), which implies T cx P 3 . Basically, the hierarchical model 
represents the trispectrum as sums of products of three power spectra, introducing only as many parameters 
as there are distinct topologies. The Rb contributions (total of 4 terms) are the "star" tree-diagrams where 
one vertex is connected to the other three, whereas the R a contributions are "snake" diagrams (total of 12 
terms). It is usually assumed that R a and Rb saturate to constants independent of scale and configuration 
in the highly non-linear regime (and this is what we will mean by the phrase HM, except when we discuss 
a modified version of HM in 



for the trispectrum (equation 



§ p.3.5| ). On the other hand, at large scales, PT predicts a hierarchical form 
|), but with R a and Rb strongly dependent on configuration, as illustrated 
in figure |^ and figure |ll| below. In our case we are interested in the configurations relevant for the power 
spectrum covariance matrix, equation (^), so 4 out of the 12 R a terms vanish, because they give -P(O) = 0. 
This can be potentially a problem, since in the limit that we approach these particular configurations some 
scales are in the linear regime, and we do not expect saturation (constant R a and Rb) to hold. As we shall 
see, this is indeed the case. 



3.3.3. The Trispectrum in the HM vs. Numerical Simulations 

Given the particular nature of the contributions to the power-spectrum covariance matrix, we will first 
discuss the validity of the HM for more general kind of configurations, were all the scales are in the non-linear 
regime. We have measured the trispectrum directly in the numerical simulations for configurations of four 
wave- vectors that form a triangle, that is, k\ — k 2l and fc 3 at an angle 9 with respect to them, for different 
ratios k 3 /k\ (fc 4 = — fem). In figure || we present two representative results, for k 3 /k\ = 1.5 and scales 
hi = 1.26 h/Mpc (top) and k\ = 1.57 h/Mpc (bottom). The results are given in terms of the hierarchical 
amplitude Q 4 defined as 

T 4 (fei, k 2 , k 3 ,-k 123 ) , , 

(15) 



[P{k 1 )P(k 2 )P(k 13 ) + eye] + [P{kt)P(k 2 )P(k 3 ) + eye] " 
The lines show the predictions of hyperextended perturbation theory (HEPT) , 

t - (1/ 2) 54- 27 2» + 23" + 6» = 12R a + 4R b 
Q* W-W 2 ) (i + 6 2»+33™ + 66™) - 16 ' (16) 



which gives the saturation value of the hierarchical amplitude Q4 in the highly non-linear regime in terms 
of the spectral index n — n(k) of the underlying linear power spectrum (Scoccimarro & Frieman 1998). 
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The spectral index has been chosen as that of the average wave-vector at each particular configuration. 
HEPT only predicts the overall amplitude of Q4; in other words, it does not attempt to model R a and Rb 
separately. In figure || we show predictions for different assumptions about the relative magnitude of R a and 
Rb, R a = Rb (solid), R a = —Rb (dotted) and 5R a = Rb (dashed). The trend is that as Rb becomes smaller 
than R a (eventually becoming negative), the configuration dependence of Q4 is opposite to that observed in 
the numerical simulations, where Q4 is slightly enhanced at 9 — over its value at 8 = it. In the opposite 
case, when Rb becomes larger than R a , an increase of Rb over R a gives a good fit to the N-body results, up 
to Rb ~ 3i? a . A stronger test of the HM would require to check different type of trispectrum configurations 
in addition to the ones we consider. Detailed testing of the HM is beyond the scope of this paper, but from 
these results we can conclude that at least for these type of configurations, the trispectrum in the non-linear 
regime agrees very well with the HM, with overall amplitude given by HEPT and weights R a ps Rb- 



3.3.4- The Power-Spectrum Covariance Matrix in the HM 

We now consider the calculation of the power-spectrum covariance matrix in the HM, and show that it 
leads to unphysical results for the cross-correlation coefficient in the limit that shells are widely separated. 

In order to get the bin-averaged trispectrum, equation (||), we need to integrate over the angle be- 
tween the two wave- vectors in the two shells under consideration. It is clear that if, say, k 2 3> fei, then 
j^ 1 (dx/2) P(\k 2 — ki\) w P(k 2 ). It turns out, however, that in practice this approximation works quite well 
in the non- linear regime even for k\ = k 2 . Thus, we get simple approximate expressions for the diagonal and 
off-diagonal contributions of the bin-averaged trispectrum 

f(ki,h) ^4(2R a + R b ) Pf. (17) 
f{k i: kj) « 2(R a + R b ) P t P] + 2(2R a + R b ) PfPj + 2R a if, (18) 

where we have assumed that ki > kj, and that R a and Rb are constants independent of configuration. These 
expressions turn out to agree with results of numerical integrations over shells to about 3% for the whole 
range of k\ and k 2 . Note that equation (|l8| ) reduces to equation ([l7j ) in the limit that fc; — >■ kj. These 
expressions imply that in the limit where the non-Gaussian contribution to the covariance matrix dominates 
{ki > kj) 



(Ra + R b ) , • , , j . 



2{2R a + R b ) 




This result implies that, unless R a = —Rb, eventually as ki 3> kj the cross-correlation coefficient r\j becomes 
larger than unity, which is unphysical. As shown in figure || a constant relation R a = —Rb is not a good fit to 
the N-body results on general trispectrum configurations. Furthermore, it turns out that a constant relation 



R a = —Rb is inconsistent with the numerical results presented in Section ^2, since it predicts that r%j decays 
much faster with shell separation than what is observed in the numerical simulations. We therefore conclude 
that a hierarchical model with amplitudes R a and Rb independent of scale and configuration is actually not 
a good description of the power- spectrum covariance matrix in the non-linear regime. 
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3.3.5. HM Predictions vs. N-body Simulations for the Covariance Matrix 

Given the results in the previous sections, we expect to find deviations from the HM predictions for the 
power-spectrum covariance matrix in the non-linear regime. We now explore the nature of these deviations 
by comparing with numerical simulations and use both PT and the HM as a tool to understand the results. 

The first non-trivial test is the scaling of diagonal elements. Both PT and the HM predict that the 
diagonal part of the trispectrum must scale as the third power of the power spectrum, equations (||) and (|l7|). 
Therefore, even though the covariance matrix in the non-linear regime receives contributions from a range 
of scales according to equation ([l3|), one expects the HM scaling to be correct. As shown in figure ^a, 
the HM scaling (solid lines) agrees with the numerical simulations (symbols). However, we find that the 
overall amplitude is a factor of five times smaller than that predicted by equation ( |l6|) for R a « Rb- This 
suppression must come from the range of scales (including large ones) that contribute in equation (|l3|), and 
therefore it is difficult to explain quantitatively without a full description of the four-point function. 

Figure ^ shows a more detailed comparison of HM predictions with the cross-correlation coefficients ry 
measured in the N-body simulations. In this calculation, we have assumed R a = Rb with Q| at (n) as given 
by HEPT. Note the very good agreement, especially around fcj s» kj. As fcj gets much larger than kj we 
see that the predicted ry start to increase, a signature that the condition R a ss Rb is breaking down, as 
discussed above. 

In light of these results, we will now use a HM with arbitrary functions R a and Rb, and explore the 
behavior of these functions implied by our N-body measurements. Note that once we give up the hypothesis 
that R a and Rb are strictly constants, we should distinguish between two kinds of R a and the R a and 
Rb in equation ( |l4| ) which depend on both configuration and scale, and the effective R a and Rb in equations 
( |l7| - |r^ ) which have the configuration dependence integrated out. We will rely on the context to differentiate 
between the two meanings here. 



Given our measurements of the power spectrum covariance matrix presented in Section 3.2, we can 
obtain estimates for R a and Rb from equations ( [l7f]l8| ). However, since the largest contribution is due to 
the first term in equation (|T§|), the best constraint is on the average value, (R a + R D )/2. This is shown in 
figure [l0|, for the particular case of kj = 0.98 h/Mpc as a function of We see that the average value shows 
a systematic decrease with shell separation. Since this configuration dependence is likely to come from scales 
not in the non-linear regime, it is useful to consider PT, which predicts a specific relation between R a and 
Rb as a function of shell separation, to see whether we can explain the decline in figure [l^. 

In figure |ll| we show the ratio Rb / R a predicted by PT as a function of configuration angle 9 between 
fci and &2 for different scales. We see that for shells close to each other, although the ratio Rb/R a varies, 
on average R a « Rb (in fact, for ki = &2 = 1 h/Mpc, the angular average yields R a m 1.5-R&). However, as 
the shells become more separated, the averaged relation between R a and Rb changes, reaching R a = —Rb in 
the limit of large separation, just what is needed in a hierarchical model to preserve the condition that the 
cross-correlation coefficient ry < 1 . 

We now examine whether the type of configuration dependence valid in PT can explain the behavior of 
(R a + Rb)/2 deduced from the N-body results. In figure [lO], the solid line shows the prediction for (R a + R )/2 
as a function of shell separation assuming that the averaged relation between R a and Rb is the same as that 
given by PT. The overall amplitude has been chosen to fit the diagonal values, at fc, = kj — 0.98 h/Mpc. We 
see that this prediction works quite well to explain the configuration dependence of the covariance matrix in 
the non-linear regime. 
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3.3.6. Summary 

To summarize, we found that the hierarchical model (HM) is not a valid description of the power- 
spectrum covariance matrix in the non-linear regime. Instead, we find that the covariance matrix behaves 
as a "generalized" HM with a configuration dependence that resembles that in PT. We interpret this as 
a consequence of the fact that even in the non-linear regime the covariance matrix receives contributions 
from large scales. On the other hand, we have tested the HM for other trispectrum configurations in the 
non-linear regime and found good agreement for R a ~ Rb and overall amplitude given by HEPT. 

Although N-body results have been restricted to the SCDM model, it is worth mentioning that hier- 
arhical amplitudes such as Q4, R a and Rb are expected to be very insensitive to cosmological parameters, 
as can be shown quite generally from the equations of motion of gravitational instability (Scoccimarro et al. 
1998, Appendix B.3). Therefore, we expect that the dependence of the covariance matrix on cosmology will 
appear only through the overall dependence on the power spectrum. 



4. Impact on Cosmological Parameter Determination 

To assess the importance of the non-Gaussian terms in the covariance matrix, suppose we are interested 
in only one parameter: the amplitude of the power spectrum over a range of scales. Let us assume that the 
shape, P(ki), is known and let us estimate the amplitude x = P(k i )/P(k i ). If only the Gaussian terms of 
the covariance were important, the minimum variance estimator for the amplitude would be, 

V, ,,. N k .P(ki)/P(ki) 

where we simply weigh the power in each bin by its inverse variance, and all bins are used up to some fc max . 
In the same Gaussian limit the variance of x would be 

a 2 x = (i 2 ) - <i) 2 = A (21) 

where N t = J2 is the total number of k modes used. 

Figure [l2] shows the ratio a 2 /(2/N t ) obtained using perturbation theory for a 1 Gpc/h box. On large 
scales, & max J;0.1 h/Mpc, PT shows that the non-Gaussian contributions do not have a significant impact 
on the determination of x, the error-bar starts increasing rapidly thereafter (for this cosmological model 
k n i = 0.33 h/Mpc, and at the smoothing scale of R ~ l/fc n ij S^a 2 /5 is of order 1). In figure |l5|a, we show 
the corresponding results from N-body simulations. In figure |l3| b we show the effective number of modes, 
defined as N c s = 2/er 2 .. As we can see, N e g varies much more slowly once we enter the non- linear regime. 

In the non-linear regime, different band-power estimates become highly correlated. What this means in 
practice is that the power at different bands fluctuate up and down together, making the overall amplitude 
of the power spectrum difficult to determine while essentially preserving information on its shape. This can 
be made more precise by considering the following covariance matrix: 

Ci^flPiPAHl-eii) (22) 

which follows from equation (Q), ignoring the Gaussian term, and from equation ()l7|). The parameter 
is some simple constant proportional to Vf. The symmetric matrix has vanishing diagonal elements, 
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eu — 0, and the off-diagonal elements determine the cross-correlations between different band-powers. We 
will consider the limit in which is small (see figure ^ , in other words all modes are highly correlated and 



Cij is nearly singular. Furthermore, as is clear from figure 
as a constant. 



is slowly varying, and we will treat 6j 



Let us consider the following rescaling of the power spectrum estimate: 



Vi 



(23) 



where the true power spectrum P(ki) is of course unknown a priori, but one can think of it instead as a 
fiducial power spectrum and we are trying to measure deviations from it. (It is actually not necessary to 
make this change of variable at all, but it will simplify some of our expressions below.) The corresponding 
covariance matrix is then 



Cf 3 = l-e(l-%) 



(24) 



We can diagonalize this matrix and ask which combinations of the band powers are better determined (see 
Hamilton 1997b for other choices of rescaling and diagonalization). There are two types of eigenvalues. 
The first is Ai = N — (N — l)e, associated with the eigenvector e% = (1, 1, 1, l)/y/N, where N is the 
number of bands. This eigenvector corresponds to the overall amplitude of the rescaled power spectrum. 
In addition, there are N — 1 degenerate eigenvectors, each with an eigenvalue A = e. The corresponding 
eigenvectors are e m = (0, 0, ...1, —1, 0..., 0)/y/2, where the 1 is in the m — 1-th position (m = 2, N). Each 
of these eigenvectors provides a measure of the local derivative of the rescaled power spectrum. Note that 
the above set of eigenvectors are linearly independent but do not form an orthogonal basis. In the case where 
e (or more precisely ) actually varies slowly with the band, but still remains small, the above conclusions 
remain largely unchanged: Ai = N + O(ey), and the rest of the eigenvalues satisfy A ~ e (e^ ~ e plus small 
variations) except that the degeneracy is lifted by the slowly varying . 

Clearly, Ai is much larger than any other eigenvalues. Hence, the combination corresponding to e\ is 
the most poorly determined. To be more precise, the fractional error in the quantity y ■ ei is 



'<[0-ei 



{y ■ e i/ 



(y ■ ei 



a + 0(e)) 



(25) 



To the extent that \J P(fcj) is slowly varying with ki, the fractional error is approximately independent of N, 
which means increasing the number of bands does not help reduce the error in this estimate of the amplitude 
of the rescaled power spectrum. The only way to reduce the error is to make (3 smaller, i.e. having a larger 
survey volume. 

On the other hand, the local spectral index of the rescaled power spectrum can be defined as y-e m /(y m ) 
(m 7^ 1), and its error is given by (we use error instead of fractional error here because the local spectral 
index could vanish): 



(y ■ e r . 



(fjn 



= VcPP» 



(26) 



which is 0(\/e) smaller than the error in the rescaled amplitude. It is intriguing that even though only two 
neighboring bands are required to estimate the shape while all bands are used to estimate the amplitude, 
it is the former that is better determined. This is a result of the high correlation limit that we have taken, 
which makes averaging over band-powers nearly useless. 
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5. Covariance Matrix of the Projected Power Spectrum: Application to Weak Lensing 

We shall now consider the statistics of the (2D) projected density field. Suppose one observes a small 
patch of the sky covering a solid angle f2. The patch is taken to be small enough that we can use the flat 
sky approximation and expand the projected mass density in Fourier modes instead of spherical harmonics. 
We will consider 

rxo 

k(6)= d X w( X ) 5(r( X )6 x ,r( X )e y ,x), (27) 
Jo 

where w(x) is a slowly varying function of the radial comoving distance to the observer x- We introduce 
the angular-diameter distance r(x), r(x) = K~ x l 2 sin K~ x l 2 x, Xi ( — K)^ 1 / 2 sinh(— K)~ 1 / 2 x for models with 
positive, zero and negative curvature respectively, and K = (flo — 1)Hq with Hq the present value of the 
Hubble constant. For weak-lensing the relevant weight function w(x) is 

i \ 3 h-2 9(Xo,x) 

w (x) = ^H n m 

/ a r( X ') r(x - x') , o^ 

9{x,x) = r~\ ' ( 28 ) 

r(X) 

in which case k is the weak-lensing convergence for background galaxies located at x — Xo- Note that our 
definition of k here is a factor of 2 larger than the one commonly used (see e.g. Jain et al. 1999). We are 
interested in the Fourier components of the projected field, from equation ( p7f ) we obtain 

which gives the two-point correlator 

(k(I)k(I')) = [dxx ^ / d X 2 ^Su(- + -)x r dk e *(xi-x») p [ J { i2/ r 2 ) + k 2 ] (30) 
J r( J r| Vri r 2 > 7„ 00 v 

where Wi = w(xi) an d similarly = r(xi)- The integral over the line-of-sight wave- vector is dominated 
by k(xi — X2) = kAx < 1, where A% must be the typical scale of variation of w(x)/ r {x) 2 (otherwise the 
total integration vanishes). In the small angle approximation, it is reasonable to assume that over the scales 
of interest w(x)/r(x) 2 is approximately constant, that is, l/r ^> 1/Ax- Therefore, l 2 /rf ^ k 2 , i.e. only 
perpendicular Fourier modes contribute to the projected field, then P{^Jl 2 jr\ + k 2 ) as P(l/r\) and the 
integral over k gives a delta function in A%. Thus, we obtain 

( k(1)k(1') ) = (2tt) fo(Z + I') J d X ^ P[l/r( X )}, (31) 



which is nothing but Limber's equation ( Peebles 1980| , Kaiser 1992] , [Kaiser 1998 ). The power spectrum 



estimator for the projected density field is accordingly 



P K (h) = A f jf «(I)«(-J), (32) 

where Af = (2ir) 2 /Q is the area of the fundamental cell in I space, and we have assumed an average over a 
ring in i-space with area A r {li) and centered at li = i X 81. Equations (|Tj) and (^) lead to 
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■/',.(/'• = I-'.t) / d X ^^P[l/r(x)}. (33) 



In figure 14 1 we show the power spectrum for the wcak-lcnsing convergence assuming that the background 
galaxies are at a redshift of z = 1. Following Jain & Seljak (1997), we use a model of the non-linear power 
spectrum proposed by Hamilton et al. (1991), and later extended by Peacock & Dodds (1994,1996) and Jain 
et al. (1995). In panel b we show the integrand of equation ( |33| ) as a function of redshift for I = 1000 and 
I = 10000. The area under the curve is proportional to P K . We conclude that the peak contribution comes 
from z = 0.3 — 0.4 depending on the I of interest, but that the integrand is a very broad function of redshift. 

The previous calculation can be easily extended to higher moments of the projected field. We obtain 



(/-,-(/,!•••/.,-(/„)),= 12m- %U, • U I d X T n \h/r(x), ln/r( X )}- (34) 



Equations ( p2[ ) and (|34j) can be used to calculate the covariance matrix of the power spectrum estimator, 
C ij = A f [ 2 ^(l-) 6ij + 
f K (h,lj) = (2 7 r) 3 |d X ^^^^^r[iJr(x),-Z 1 /r(x)J 2 /r(x),-Wr( X )]. (35) 



Before presenting the results of the numerical evaluation of the different terms in equation (|3J]) , we will 
make an order of magnitude estimate of the size of the different terms. In the process, we hope to gain some 
physical insight and understand how the different terms scale with the parameters of the problem. The ratio 
of the non-Gaussian to Gaussian terms is (for i = j) 

n =^k- ( 36 ) 



We then make the following crude approximations 



P K (l) « (2tt)A x * -J^ P[l/r( X *)} , 



f K (l,l) « (2tt) 3 A X * ^lf[l/r( X *),l/r( X *)}, (37) 

nx v 



where we have assumed that the line of sight integral is dominated by contributions from x* i A%*/2, and 
that the integrand remains roughly constant over this interval. We neglect the configuration dependence of 
the trispectrum and just use its typical value at the scale of interest. Note that both the power spectrum 
and the trispectrum are a function of time so they should be evaluated at x* ■ Under these simplifying 
assumptions the ratio becomes 

_ %Ar f 

r 2 ( X *)A X *P 2 ' [ 1 

It is interesting to compare these formulae with their analogues in 3D (equation If we make the 
identification k = l/r(x*), Afc x = 2n/Ax* and 5k = Sl/r(x*), we can rewrite the prefactor tt A r / V 2 (x*) Ax* 
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as 2TrkAk x 8k/2. Let's compare this with the relevant prefactor in the 3D case: 4irk 2 5k/2. By assumption, 
k 3> Afc x , which means that the Gaussian variance is much larger, or the ratio of non-Gaussian to Gaussian 
terms is much smaller, in 2D compared to 3D. This is a result of the fact that only the Fourier modes 
perpendicular to the line of sight contribute to the 2D projection, by virtue of an otherwise rapidly oscillating 
integrand (see derivation above). This means a far fewer number of non- linear modes are available in 2D 
compared to 3D, for a given k. While in 3D all the modes in a shell of volume Ank 2 5k are available, in 
2D only those in a ring-shaped region of area 2irk5k and height Ak x contribute to non-Gaussianity. Hence, 
projection raises the significance of the Gaussian variance relative to the non-Gaussian term. 

The projection has another interesting effect. As discussed in the context of the hierarchical ansatz, 
the trispectrum scales as the third power of the power spectrum. Contrary to what happens for the three- 
dimensional case, the number of modes only increases like k 2 = I 2 /r 2 (x) as we go to higher / (we are assuming 
51 ~ I here as it gives the correct order of magnitude estimate of the true significance of non-Gaussian terms). 
The ratio TZ scales as TZ oc k 2 P(k)/A\* instead of k 3 P(k) for the 3D case. As a consequence, if the 3D 
power spectrum decreases faster than fc~ 2 , the relative importance of the non-Gaussian variance can actually 
decrease as one goes to smaller angular scales. 

As an example we consider the weak-lensing effect on background galaxies at a redshift of z ~ 1. We 
then take x* corresponding to a redshift of z ~ 0.4, which means a distance of r(x*) = X* ~ lh _1 Gpc. 
Because the integrand is a slowly varying function of x we w iH take A%* ~ x* ■ We will concentrate on 
scales I > 1000 (9 < l/l as 3 in the correlation function) which at this distance correspond to k > 1 h/Mpc. 
Since these scales are all in the non-linear regime, to estimate the magnitude of TZ we use the value of the 
trispectrum obtained in the simulations, T ~ 17P 3 , to get 

I w_g w lh-iGpc y/ lh-iGpc w T/P 3 i^/lh^Gpch 
1000/ V1000/V r(x) ' \ A X A 17 0.5h" 3 Mpc 3 / [ ' 

This is in contrast with the 3D case where the corresponding TZ is larger than 1 for k > 1 h/Mpc. As we said 
above, projection decreases the effective number of non-linear modes which makes the Gaussian variance 
comparatively more important. 

We can compare our estimate of TZ with other measures of non-Gaussianity in simulations of weak 
lensing. Jain et al. (1999) computed the skewness of k and found S 3 a as 1.6 on a scale of 3 . To make 
the connection with our result we will assume that this number also estimates S^a 2 , as expected by simple 
scaling. Then we note that S^a 2 can be written as, 

„, m2 J d 2 hd 2 l 2 d 2 l 3 WW2WW123 T K (h,l 2 ,l 3 ,-h 23 ) /An . 

S4{R)<T [jm w 2 (m) p K (i)\ 2 ' (40) 

where Wi = W(kR), with W being the Fourier transform of the smoothing window. We see that S^a 2 
measures the same ratio of trispectrum and power spectrum that is relevant for TZ but with one caveat. 
S4& 2 is determined by an average over all trispectrum configurations while TZ is sensitive to those particular 



TZ - 0.2 



shapes relevant for the covariance matrix, and as we discussed in §3.3.5, the latter is smaller than the naive 
expectation by a factor of five. Thus we expect S^cr 2 /5 to be comparable to TZ, as indeed it is. 

Also note that the weight function w{x) cancels out in the ratio TZ, so the above conclusions apply equally 
well to other types of projection, such as that in angular galaxy surveys. The non-Gaussian contribution 
to the error is comparable but does not dominate over the Gaussian contribution for the cluster-normalized 
SCDM model, and for mass distributions projected over cosmological distances (~ lh _1 Gpc). For angular 
surveys, however, the effects of non-Gaussianity are likely to be larger than in the weak-lensing case, since 
the projection takes place over a smaller range of scales. 
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In figure p_5j we show the result of numerically integrating equation (pq) . We use here the hierarchical 



form together with HEPT discussed in §3.2 , but rescale the amplitude of R a (« Rb) to match the amplitude 
of the power spectrum variance measured in the simulations (T/P 3 = 17 at k = lhMpc -1 for the SCDM 
model) . The figure shows the ratio of the diagonal terms of the covariance so this value is all that is needed 
to normalize the calculation. This should be a good approximation for most of the scales shown, but would 
start to break down close to I ~ 100, in which case the PT result would begin to take over (see equation 
||). We show the results for three cosmological models, the SCDM model we have been considering so far 
(as = 0.60, r = 0.5), an open model with f2 m — 0.3, Y = 0.21 and as — 0.85, and a flat cosmological 
constant (A) dominated model with f2 m = 0.3 and Qa = 0.7. We have assumed the the hierarchical ratios 
R a and Rb are independent of the cosmological model, which should hold to a very good approximation 



( Scoccimarro et al. 1998 ) 



The ratio 1Z obtained in the numerical integration for SCDM is in agreement with our simple estimate 
in equation ([391) . Furthermore we can see that the ratio has a maximum around I ~ 8000, which is a 
consequence of the fact that the 3D power spectrum decreases faster than k~ 2 therefater. Even though we 
are probing scales that are more non-linear as we go to higher I, the projection actually makes the estimates 
of the power spectrum of k more Gaussian, since the number of non-linear modes does not increase as 
fast as to compensate for the decaying power spectrum. Therefore, contrary to what happens in 3D, the 
non-Gaussian effects never dominate even at very small scales for the SCDM model. 

Figure [l5] shows that the non-Gaussian effects can be more important in other models, however. The 
main difference between models arises due to their different normalizations, parametrized by as (in all 
cases, the normalization is chosen to yield the correct cluster abundance today). An additional effect which 
enhances the signature in the open model is the difference in the fluctuation growth rates. The parameter 
as specifies the normalization at the present time but the lensing effect is sensitive to the amplitude of the 
fluctuations all the way up to the redshift of the background galaxies. In the open model structure grows 
slower than in the flat one, so for a given normalization today the average power up to a redshift z ~ 1 
is significantly larger. The growth rate of the cosmological constant model is intermediate between that 
of the open and flat models. Hence, although the open and cosmological constant models have the same 
present normalization, the size of the ratio in figure ^| differs for the two models. Finally, there is one more 
differentiating factor: the angular-diameter distance to the relevant redshifts is largest in the A model, which 
tends to further reduce TZ (equation J39j). 

Our results for different cosmologies agree roughly with the numerical investigations carried out by van 
Waerbeke et al. (1998). We generally find, however, a smaller (by a factor of order two) non-Gaussian 
contribution than they do. The source of this discrepancy is likely the different treatment of the dynamics 
of gravitational clustering. As described above, we use as an approximation the hierarchical model for the 
diagonal non-Gaussian contribution; on the other hand, van Waerbeke et al. (1998) use the dynamics of 
second-order Lagrangian perturbation theory in 2D. 

In summary, we have shown that the non-Gaussian terms in the diagonal of the projected power spec- 
trum, which arise due to the non-linear nature of gravitational clustering, can be the dominant source of 
error in some models, depending on the normalization and cosmology. These terms should be included 



when analyzing future surveys or when trying to predict their capabilities (van Waerbeke et al. 1999, Hu & 



Tegmark 1998). Perhaps more importantly, as in the three-dimensional case, gravitational clustering induces 
correlations between different band-powers, in addition to increasing their individual error-bars. Calcula- 
tions similar to those performed for figure |] are straightforward to carry out for weak-lensing, but will not 



be included here. 
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6. Discussion 

We have analyzed the covariance matrix for band-power estimates, both for the three-dimensional 
matter density and its angular projection. Three general statements can be made. First, the non-linear 
nature of gravitational clustering tends to increase the diagonal variance over the Gaussian error as well as 
induce correlations between different band-powers. Second, for scales k where 4irk 3 P(k) < 1, the covariance 
matrix is reasonably well approximated by its Gaussian part; conversely, the Gaussian approximation rapidly 
breaks down on scales 4irk 3 P(k) > 1. Third, and interestingly band-powers on non- linear scales are actually 
significantly correlated with band-powers on quasilinear scales (see e.g. figure ^); this is because the growth 
of wave modes on small scales is significantly affected by the presence of long-wave modes. 

We have also discussed how the relative importance of Gaussian and non-Gaussian terms is somewhat 
dependent upon the choice of binning in /c-space, because the binning affects the Gaussian covariance but 
not the non-Gaussian one (equation Coarse graining in fc-space lowers the Gaussian variance, while 

increasing the survey size decreases the overall amplitude of the covariance matrix. However, there are 
binning-independent ways to quantify the relative importance of non-Gaussian versus Gaussian variances, 
such as by focusing on the estimation of the amplitude of the power spectrum (see . In this particular 
case, one is essentially using a very coarse bin which includes all wave-modes. This is the reason why 
simple estimates such as the one given in equation (|o|) work as a rough guide as to the importance of 
non-Gaussianity. In the highly non-linear regime where all wave modes are highly correlated, the amplitude 
of a rescaled power spectrum (equation p3|) is more poorly determined compared to its shape. The key to 
decreasing error-bars in this case is to increase the survey volume, rather than adding more wave-bands. 

Angular projection adds interesting new twists to the above general picture. We have studied the 
projection relevant for weak lensing in more detail, but most of our conclusions should hold for other 
projections as well since the relevant weight function w(x) (equation p7j| ) drops out of the non-Gaussian- 
to-Gaussian ratio TZ (equation |3^| ). Projection tends to reduce the relative importance of non-Gaussianity, 
as fewer non-linear modes contribute than in 3D. In particular, for the cluster-normalized SCDM model, 
even though the non-Gaussian terms eventually dominate the covariance of the 3D power spectrum on small 
enough scales, they never dominate for the 2D case, though they are comparable to the Gaussian terms on 
certain angular scales. However, the amplitude of the effects we study is very sensitive to the cosmological 
model, in particular to the normalization er§ and the fluctuation growth rate. Cluster-normalized low matter 
density models tend to show stronger signs of non-Gaussianity, and open models even more so than A 
models. We find that for most reasonable models the non-Gaussian terms in the covariance grow significantly 
compared to the Gaussian terms around I ~ 1000 which corresponds to 9 ~ 3 . They even dominate for the 
open CDM model. This will have a significant impact on the analysis of future weak-lensing surveys. 

To get a handle on the size of the various effects we are interested in, we have used perturbation theory 
for the linear or quasilinear scales, and the hierarchical ansatz and numerical simulations for the non- linear 
regime. As an outgrowth of this investigation, we have shown that the hierarchical ansatz cannot be valid 
with R a and Rb strictly constant, for the particular type of trispectrum configurations relevant for the 
power-spectrum covariance. If this simple model were correct, then the correlation coefficient (equation 
||) would become larger than one for two widely separated shells, unless R a = —Rb- But it turns out that 
R a = —Rb gives the wrong shape of as a function of shell separation. Hence, the hierarchical ansatz 
cannot be valid in its simplest form. We argue this failure is a consequence of the fact that even in the non- 
linear regime the covariance matrix receives contributions from large scales, and show how modifications of 
the hierarchical ansatz based on PT can be made to give predictions that match our numerical results. 
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There are at least two sets of issues we have left untouched here. First, we have ignored the effect of 
biasing in our calculations. A simple linear biasing is of course trivial to include. In this case the relevant 
ratio of Gaussian to non-Gaussian terms in the covariance is V s T g /2P g = V S T/2P 2 oc k 3 P = k 3 P g /b 2 , 
assuming the hierarchical ansatz, where P g and T g stand for the power spectrum and trispectrum of the 
galaxies and b is the linear bias parameter. Clearly the non-Gaussian effects we study are induced by gravity, 
so the relevant quantity that governs their amplitude is the size of the matter, not galaxy, fluctuations. The 
value of k where the error-bars start to become larger than what is expected for a Gaussian random field 
marks the non-linear scale. Thus if k 3 P g is different from one at this scale we can conclude that there is 
a significant bias between the mass and galaxy fluctuations, and in the simple linear bias model we could 
in principle try to measure b through this effect. In reality, however, the biasing relation is likely to be 
complicated by non-linearity and stochasticity, especially on scales where the non-Gaussian covariance is not 
negligible. 

Lastly, we have focused throughout this paper on the sample-variance dominated regime. In reality, 
shot-noise, either due to the discrete nature of galaxies or to their random orientations, might be non- 
negligible. Moreover the survey window will be important for scales approaching the size of the survey. It 
is relatively straightforward to generalize our expressions to include these effects. An interesting question is 
how to obtain the optimal weighting of the data in the presence of non-Gaussianity. This will be presented 
in a separate paper. 
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Fig. 1. — Diagonal elements of the power spectrum covariance matrix normalized by the Gaussian variance 
for SCDM with ag — 0.60 (z = 0). The centers of the shells are given by ki — (2iri) h/Gpc, and width 
5k = 2n h/Gpc. For this model the non-linear scale is k n i = 0.33 h/Mpc. 
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Fig. 2. — Correlation coefficients = Cij /{CuCjj} 1 / 2 as a function of ki for different fixed kj obtained 
using PT for a 1 Gpc/h box, and the SCDM model. The k shells are the same as in figure |l| The values of 
kj can be inferred from the place where nj — 1 . In each panel we plot the results for two different values of 



kj. 
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Fig. 3. — The coefficient ^12(^1, fe, 6*) [equation ([To|)l as a function of 9 for different scales. From top to 
bottom, (h,k 2 ) = (0.1,0.4) h/Mpc, (0.1,0.2) h/Mpc, (0.2,0.4) h/Mpc, and (0.4,0.8) h/Mpc. 



-23 - 




Fig. 4. — The top panel shows the diagonal elements of the covariance matrix normalized by the Gaussian 
variance, obtained by comparing 20 PM simulations. The dashed line shows the predictions of PT, equa- 
tion ([}]), and the solid line the hierarchical scaling, equation (0). The bottom panel shows the fractional 
error in the band-power estimates. This fractional error scales with the size of the survey or simulation box, 
the results in the figure correspond to a volume Vq = (100 h _1 Mpc) 3 . Results for other volumes can be 
obtained by scaling by (Vq/V) 1 ^ 2 . The vertical line on the x axis indicates the non- linear scale. The width 
of shells in A:-space is 8k = 27r/100 h/Mpc. 
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Fig. 5. — Correlation coefficients from the N-body simulations as a function of ki for a fixed kj. The two 
sets of points (triangles and squares) in each plot correspond to different kj's. The different values of kj can 
be deduced from the place where = 1. The mark on the x axis indicates the non-linear scale. The width 
of shells in A:-space is 5k = 47r/100 h/Mpc. 
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Fig. 6. — A comparison of ry obtained from PT and N-body simulations, at erg = 0.375 (z — 0.6). Note the 
noise level in the covariance matrix measured from the numerical simulations; this is due to the low number 
(20) of realizations used to estimate the ensemble averages. The width of shells in fc-space is 6k — 27r/100 
h/Mpc. 
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Fig. 7. — Cross-correlation coefficients m obtained N-body simulations in redshift space, for ag, = 0.60. The 
top panel shows ry for the monopole of the power spectrum, whereas the bottom panel shows ry for the 
quadrupole power spectrum. The width of shells in fc-space is Sk = 87r/100 h/Mpc. 
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Fig. 8. — The trispectrum hierarchical amplitude Q4 for configurations k\ = hi and ^3 = 1.5fci as a function 
of the angle cos(#) = k\ ■ k\ in the non-linear regime, for two different scales, ki = 1.26 h/Mpc (top) and 
ki = 1.57 h/Mpc (bottom). The lines show the predictions of HEPT (equation |l6|) plus assuming R a = Rt, 
(solid), R a = —Rb (dotted) and 5R a = Rb (dashed). The numerical simulations results are denoted by 
symbols with error bars determined from 20 realizations. 
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Fig. 9. — Correlation coefficients Tij in the non-linear regime predicted by HEPT (solid lines) for R a ss Rb 
compared with those from numerical simulations (symbols). The width of shells in fc-space is 8k = 47r/100 
h/Mpc. 
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Fig. 10. — Estimate of (R a + i?h)/2 from the measured power-spectrum covariance matrix, for kj = 
0.98 h/Mpc, as a function of ki, using equations (|l7|-^8|). The solid lines show the expected (R a + Rb)/2 
value assuming that the configuration dependence in the non-linear regime is the same as that given by PT. 
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Fig. 11. — The ratio Rt/R a given by PT as a function of configuration angle 6 between k\ and k 2 for different 
scales. From top to bottom, (h,k 2 ) = (1,2) h/Mpc, (1,4) h/Mpc, (1,8) h/Mpc, and (1,16) h/Mpc. Note 
how the relation R a = —Rb eventually holds for most configurations as the shells become very distant. 
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Fig. 12. — Variance in x as a function of k ma x- Note that the signature of non-Gaussianity is much larger 
here than what could be seen in the diagonal terms coefficients shown in figure [l[ This is a result of the 
cross-correlations between bins. 
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Fig. 13. — Panel (a) shows the variance of x measured using the N-body simulations. Panel (b) shows the 
effective number of modes. The results are for a% — 0.60 (z = 0) and the non-linear scale is shown with a 
mark on the x axis. 
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(1+z) 

Fig. 14. — Panel (a) shows the power spectrum of the projected mass density for the cluster-normalized 
SCDM model. The sources are assumed to be located at z = 1. The bottom panel illustrates where the 
contribution is coming from in redshift. The area under the curve is proportional to the projected power 
spectrum, the normalization is arbitrary. 
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Fig. 15. — Ratio of the non-Gaussian to Gaussian terms in the covariance of the projected mass density, for 
cluster-normalized SCDM (solid line), ACDM (dot-dashed line) and open CDM (dashed line) models. The 
hierarchical ansatz was assumed in the calculation, with R a rs R\,. See text for details. 



